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Nucleotide sequence of ATCC21529 ask (SEQ ID N0:1) 

1 GTGGCCCTGG TCGTACAGAA ATATGGCGGT TCCTCGCTTG AGAGTGCGGA 

51 ACGCATTAGA AACGTCGCTG AACGGATCGT TGCCACCAAG AAGGCTGGAA 

101 ATGATGTCGT GGTTGTCTGC TCCGCAATGG GAGACACCAC GGATGAACTT 

151 CTAGAACTTG CAGCGGCAGT GAATCCCGTT CCGCCAGCTC GTGAAATGGA 

201 TATGCTCCTG ACTGCTGGTG AGCGTATTTC TAACGCTCTC GTCGCCATGG 

251 CTATTGAGTC CCTTGGCGCA GAAGCTCAAT CTTTCACTGG CTCTCAGGCT 

301 GGTGTGCTCA CCACCGAGCG CCACGGAAAC GCACGCATTG TTGACGTCAC 

351 ACCGGGTCGT GTGCGTGAAG CACTCGATGA GGGCAAGATC TGCATTGTTG 

401 CTGGTTTTCA GGGTG7TAAT AAAGAAACCC GCGATGTCAC CACGTTGGGT 

451 CGTGGTGGTT CTGACACCAC TGCAGTTGCG TTGGCAGCTG CTTTGAACGC 

501 TGATGTGTGT GAGATTTACT CGGACGTTGA CGGTGTGTAT ACCGCTGACC 

551 CGCGCATCGT TCCTAATGCA CAGAAGCTGG AAAAGCTGAG CTTCGAAGAA 

601 ATGCTGGAAC TTGCTGCTGT TGGCTCCAAG ATTTTGGTGC TGCGCAGTGT 

651 TGAATACGCT CGTGCATTCA ATGTGCCACT TCGCGTACGC TCGTCTTATA 

701 GTAATGATCC CGGCACTTTG ATTGCCGGCT CTATGGAGGA TATTCCTGTG 

751 GAAGAAGCAG TCCTTACCGG TGTCGCAACC GACAAGTCCG AAGCCAAAGT 

801 AACCGTTCTG GGTATTTCCG ATAAGCCAGG CGAGGCTGCC AAGGTTTTCC 

851 GTGCGTTGGC TGATGCAGAA ATCAACATTG ACATGGTTCT GCAGAAcgtc 

901 tcctctgftGG AAGACGGCAC CACCGACATC ACGTTCACCT GCCCTCGCGC 

951 TGACGGACGC CGTGCGATGG AGATCTTGAA GAAGCTTCAG GTTCAGGGCA 

1001 ACTGGACCAA TGTGCTTTAC GACGACCAGG TCGGCAAAGT CTCCCTCGTG 

1051 GGTGCTGGCA TGAAGTCTCA CCCAGGTGTT ACCGCAGAGT TCATGGAAGC 

1101 TCTGCGCGAT GTCAACGTGA ACATCGAATT GATTTCCATC TCTGAGATCC 

1151 GCATTTCCGT GCTGATCCGT GAAGATGATC TGGATGCTGC TGCACGTGCA 

1201 TTGCATGAGC AGTTCCAGCT GGGCGGCGAA GACGAAGCCG TCGTTTATGC 

1251 AGGCACCGGA CGCTAA 
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Amino Acid Sequence of ATTC21529 ask (SEQ ID N0:2) 

GTGGCCCTGGTCGTACAGAAATATGGCGGTTCCTCGCTTGAGAGTGCGGAACGCATTAGA 
1 + + + + + + 60 

MALVVQKYGGSSLESAERIR 

AACGTCGCTGAACGGATCGTTGCCACCAAGAAGGCTGGAAATGATGTCGTGGTTGTCTGC 
61 + + + + + + 120 

NVAERIVATKKAGNDVVVVC 

TCCGCMTGGGAGACACCACGGATGMCTTCTAGAACTTGCAGCGGCAGTGAATCCCGTT 
121 + + + + -...+ + 180 

S AM GDTTD E LL E LA AA V N P V 

CCGCCAGCTCGTGAAATGGATATGCTCCTGACTGCTGGTGAGCGTATTTCTAACGCTCTC 

181 + + + +. ... + + 240 

PP AREMDMLLTAGERISNAL 

GTCGCCATGGCTATTGAGTCCCTTGGCGCAGAAGCTCAATC7TTCACTGGCTCTCAGGCT 
241 . + + + + + + 300 

V AM AIESLGAEAQSFTGSQA 

GGTGTGCTCACCACCGAGCGCCACGGAAACGCACGCATTGTTGACGTCACACCGGGTCGT 
301 + + + + + + 360 

GVLTTERHGNARIVDVTPGR 

GTGCGTGMGCACTCGATGAGGGCMGATCTGCATTGTTGCTGGTTTTCAGGGTGTTAAT 
361 + + + + -+ + 420 

VRE ALDEGKIC IVAGFQGVN 

AMGAAACCCGCGATGTCACCACGTTGGGTCGTGGTGGTTCTGACACCACTGCAGTTGCG 
421 + + + + + + 480 

KETRDVTTLGRGGSDTTAVA 

TTGGCAGCTGCTTTGAACGCTGATGTGTGTGAGAT7TACTCGGACGTTGACGGTGTGTAT 
481 + + + + + + 540 

LAAALNADVCEIYSDVDGVY 

ACCGCTGACCCGCGCATCGTTCCTAATGCACAGAAGCTGGAAAAGCTCAGCTTCGAAGAA 
541 + + + + + + 600 

TADPRIVPNAQKLEKLSFEE 

ATGCTGGMCTTGCTGCTGTTGGCTCCMGATTTTGGTGCTGCGCAGTGTTGAATACGCT 
601 + + + + + + 660 

MLE LAAVGSKI LVLRSVEYA 
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CGTGCATTCAATGTGCCACTTCGCGTACGCTCGTCTTATAGTAATGATCCCGGCACTTTG 
661 + + ---+ + + + 720 

RAFNVPLRVRSSYSNDPGTL 

ATTGCCGGCTCTATGGAGGATATTCCTGTGGAAGAAGCAGTCCTTACCGGTGTCGCAACC 
721 + + + + + + 780 

IAGSMEDI PVEEAVLTGVAT 

GACAAGTCCGAAGCCAAAGTAACCGTTCTGGGTATTTCCGATAAGCCAGGCGAGGCTGCC 
781 + + ---•+ --+ + + 840 

DKSEA KVTVLGI SDKPGEAA 

MGGTTTTCCGTGCGTTGGCTGATGCAGAAATCAACATTGACATGGTTCTGCAGAAcgtc 
841 t--+ + + + + + 900 

KVFRALADAEINI DMVL QNV 

tcctctgtGGAAGACGGCACCACCGACATCACGTTCACCTGCCCTCGCGCTGACGGACGC 
901 + + + + + + 960 

SSV EDGTTDITFTCPRADGR 

CGTGCGATGGAGATCTTGMGMGCTTCAGGTTCAGGGCMCTGGACCMTGTGCTTTAC 
961 + + +--- +-- + + 1020 

RAM E I L K K L Q VQ GN WTN V L Y 

GACGACCAGGTCGGCAAAGTCTCCCTCGTGGGTGCTGGCATGAAGTCTCACCCAGGTGTT 
1021 + + + + + + 1080 

DDQVGKVS LVGAGM KSHPGV 

ACCGCAGAGTTCATGGAAGCTCTGCGCGATGTCAACGTGAACATCGAATTGATTTCCATC 
1081 + + + + + + 1140 

TAEFMEA LRDVNVN IELISI 

TCTGAGATCCGCATTTCCGTGCTGATCCGTGAAGATGATCTGGATGCTGCTGCACGTGCA 
1141 ----+ + ---+ + + + 1200 

SEIRISVLIREDDLDAAARA 

TTGCATGAGCAGTTCCAGCTGGGCGGCGAAGACGAAGCCGTCGTTTATGCAGGCACCGGA 
1201 + + + + + + 1260 

LHEQFQLGGEDEAVVYAGTG 

CGCTAA 
1261 1266 



R * 
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Nucleotide sequence of ATCC21529 asd (SEQ ID NO: 3) 



1 ATGACCACCA TCGCAGTTGT TGGTGCAACC GGCCAGGTCG GCCAGGTTAT 

51 GCGCACCTTT TTGGAAGAGC GCAATTTCCC AGCTGACACT GTTCGTTTCT 

101 TTGCTTCCCC GCGTTCCGCA GGCCGTAAGA TTGAATTCCG TGGCACGGAA 

151 ATCGAGGTAG AAGACATTAC TCAGGCAACC GAGGAGTCCC TCAAGGGCAT 

201 CGACGTTGCG TTGTTCTCTG CTGGAGGCAC CGCTTCCAAG CAGTACGCTC 

251 CACTGTTTGC TGCTGCAGGC GCGACTGTTG TGGATAACTC TTCTGCTTGG 

301 CGCAAGGACG ACGAGGTTCC ACTAATCGTC TCTGAGGTGA ACCCTTCCGA 

351 CAAGGATTCC CTGGTCAAGG GCATTATTGC GAATCCTAAC TGCACCACCA 

401 TGGCTGCAAT GCCAGTGCTG AAGCCACTGC ACGATGCCGC TGGTCTTGTA 

451 AAGCTTCACG TTTCCTCTTA CCAGGCTGTT TCCGGTTCTG GTCTTGCAGG 

501 TGTGGAAACC TTGGCAAAGC AGGTTGCTGC AGTTGGCGAC CACAACGTTG 

551 AGTTCGTCCA TGATGGACAG GCTGCTGACG CAGGCGATGT CGGACCTTAC 

601 GTTTCCCCAA TCGCTTACAA CGTGCTGCCA TTCGCCGGAA ACCTCGTCGA 

651 TGACGGCACC TTCGAAACCG ACGAAGAGCA GAAGCTGCGC AACGAATCCC 

701 GCAAGATTCT CGGCCTCCCA GACCTCAAGG TCTCAGGCAC CTGCGTCCGC 

751 GTGCCGGTTT TCACCGGCCA CACGCTGACC ATTCACGCCG AATTCGACAA 

801 GGCAATCACC GTCGAGCAGG CGCAGGAGAT CTTGGGTGCC GCTTCAGGCG 

851 TCGAGCTTGT CGACGTCCCA ACCCCACTTG CAGCTGCCGG CATTGACGAA 

901 TCCCTCGTTG GACGCATCCG TCAGGACTCC ACTGTCGACG ACAACCGCGG 

951 TCTGGTTCTC GTCGTATCTG GCGATAACCT TCGCAAGGGC GCAGCACTGA 

1001 ACACCATTCA GATTGCTGAG CTGCTGGTTA AGTAA 
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Amino acid sequence of ATCC21529 asd (SEQ ID NO: 4) 

ATGACCACCATCGCAGTTGTTGGTGCAACCGGCCAGGTCGGCCAGGTTATGCGCACCTTT 
1 + + + + + + 60 

MTTIAVVGATG QVGQVMRTF 

TTGGMGAGCGCMTTTCCCAGCTGACACTGTTCGTTTCTTTGCTTCCCCGCGTTCCGCA 
61 + + + + + + 120 

LEERNFPADTVRFFASPRSA 

GGCCGTAAGATTGAATTCCGTGGCACGGAAATCGAGGTAGAAGACATTACTCAGGCAACC 
121 + + + + + + 180 

GvRKIEFRGTEIEVEDITQAT 

GAGGAGTCCCTCAAGGGCATCGACGTTGCGTTGTTCTCTGCTGGAGGCACCGCTTCCAAG 
181 + + + +-- + --+ 240 

EESLKG I DVALFSAGGTA SK 

CAGTACGCTCCACTGTTTGCTGCTGCAGGCGCGACTGTTGTGGATAACTCTTCTGCTTGG 
241 --- + + +-- + + --+300 

QYAP LFAAAGATVVDNSSAW 

CGCAAGGACGACGAGGTTCCACTAATCGTCTCTGAGGTGAACCCTTCCGACAAGGATTCC 
301 + + + + + + 360 

RKDDEVPLIVSEVNPSDKD S 

CTGGTCAAGGGCATTATTGCGAATCCTAACTGCACCACCATGGCTGCAATGCCAGTGCTG 
361 + +- + + + + 420 

LVKG I IANPNCTTMAA MPV L 

MGCCACTGCACGATGCCGCTGGTC7TGTAAAGCTTCACGTTTCCTCTTACCAGGCTGTT 
421 + + + + + + 480 

KPLHDAAGLVKLHVSSYQAV 

TCCGGTTCTGGTCTTGCAGGTGTGGAAACCTTGGCAAAGCAGGTTGCTGCAGTTGGCGAC 
481 + + + + + + 540 

SGSGLAGVETLAKQVAAVGD 
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CACAACGTTGAGTTCGTCCATGATGGACAGGCTGCTGACGCAGGCGATGTCGGACCTTAC 
541 + + + + +-- + 600 

HNVEFVHDGQAADAGDVGPY 

GTTTCCCCAATCGCTTACAACGTGCTGCCATTCGCCGGAAACCTCGTCGATGACGGCACC 
601 + + + + + + 660 

VSPIAYNVLPFAGNLVDDGT 

TTCGAAACCGACGAAGAGCAGAAGCTGCGCAACGAATCCCGCAAGATTCTCGGCCTCCCA 
661 + + + + + + 720 

FETD EEQKLRNESRKI L GLP 

GACCTCAAGGTCTCAGGCACCTGCGTCCGCGTGCCGGTTTTCACCGGCCACACGCTGACC 
721 + + + + ---+ + 780 

DLKV SGTCVRV P V F T G H T L T 

ATTCACGCCGAATTCGACAAGGCAATCACCGTCGAGCAGGCGCAGGAGATCTTGGGTGCC :-■ 
781 + + + + --+ + 840 

I HAEFDKA ITVEQAQE I LGA 

GCTTCAGGCGTCGAGCTTGTCGACGTCCCAACCCCAC7TGCAGCTGCCGGCATTGACGAA 
841 + ---+ + + -+ + 900 

A SGVE LVDVPTP LAAAG I DE 

TCCCTCGTTGGACGCATCCGTCAGGACTCCACTGTCGACGACAACCGCGGTCTGGTTCTC 
901 + + + + + + 960 

SLVGRIRQDSTVDDN RGLVL 

GTCGTATCTGGCGATAACCTTCGCAAGGGCGCAGCACTGAACACCATTCAGATTGCTGAG 
961 + + + + + + 1020 

VVS'GDNLRKGA- ALNTIQIAE 

CTGCTGGTTAAGTAA 
1021 + 1035 

L L V K * 
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Nucleotide sequence of dapA (SEQ ID NO: 5) 



1 ATGAGCACAG GTTTAACAGC TAAGACCGGA GTAGAGCACT TCGGCACCGT 

51 TGGAGTAGCA ATGGTTACTC CATTCACGGA ATCCGGAGAC ATCGATATCG 

101 CTGCTGGCCG CGAAGTCGCG GCTTATTTGG TTGATAAGGG CTTGGA7TCT 

151 TTGGTTCTCG CGGGCACCAC TGGTGAATCC CCAACGACAA CCGCCGCTGA 

201 AAAACTAGAA CTGCTCAAGG CCGTTCGTGA GGAAGTTGGG GATCGGGCGA 

251 AGCTCATCGC CGGTGTCGGA ACCAACAACA CGCGGACATC TGTGGAACTT 

301 GCGGAAGCTG CTGCTTCTGC TGGCGCAGAC GGCCTTTTAG TTGTAACTCC 

351 TTATTACTCC AAGCCGAGCC AAGAGGGATT GCTGGCGCAC TTCGGTGCAA 

401 TTGCTGCAGC AACAGAGGTT CCAATTTGTC TCTATGACAT TCCTGGTCGG 

451 TCAGGTATTC CAATTGAATC TGATACCATG AGACGCCTGA GTGAATTACC 

501 TACGATTTTG GCGGTCAAGG ACGCCAAGGG TGACCTCGTT GCAGCCACGT 

551 CATTGATCAA AGAAACGGGA CTTGCCTGGT ATTCAGGCGA TGACCCACTA 

601 AACCTTGTTT GGCTTGCTTT GGGCGGATCA GGTTTCATTT CCGTAATTGG 

651 AGATGCAGCC CCCACAGCAT TACGTGAGTT GTACACAAGC TTCGAGGAAG 

701 GCGACCTCGT CCGTGCGCGG GAAATCAACG CCAAACTATC ACCGCTGGTA 

751 GCTGCCCAAG GTCGCTTGGG TGGAGTCAGC TTGGCAAAAG CTGCTcTGCG 

801 TCTGCAGGGC ATCAACGTAG GAGATCCTCG ACTTCCAATT ATGGCTCCAA 

851 ATGAGCAGGA ACTTGAGGCT CTCCGAGAAG ACATGAAAAA AGCTGGAGTT 

901 CTATAA 
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<3 

Amino acid sequence of dapA (SEQ ID NO: 6) 

ATGAGCACAGGTTTAACAGCTAAGACCGGAGTAGAGCACTTCGGCACCGTTGGAGTAGCA 
1 + + + + + + 60 

MSTGLTAKTGVEHFGTVGVA 

ATGGTTACTCCATTCACGGAATCCGGAGACATCGATATCGCTGCTGGCCGCGAAGTCGCG 
61 + + + + +--- + 120 

M V TP FTESGD I D IAAGRE VA 

GCTTATTTGGTTGATMGGGCTTGGATTCTTTGGTTCTCGCGGGCACCACTGGTGAATCC 
121 + ---+--- +.........+ + + 180 

A Y L V D K G L D S L V L A G T T G E S 

CCAACGACAACCGCCGCTGAAAAACTAGAACTGCTCAAGGCCGTTCGTGAGGAAGTTGGG - 
181 + ---+ + --+-- + + 240 

PTTTAAEKL ELL K A V R E E V G - 

GATCGGGCGAAGCTCATCGCCGGTGTCGGAACCAACAACACGCGGACATCTGTGGAACTT 
241 + +- +-- + + + 3oo 

DRAKL I AGVGTNNTRTSV E L 

GCGGMGCTGCTGCTTCTGCTGGCGCAGACGGCCTTTTAGTTGTAACTCCTTATTACTCC 
301 + + + + ...+ + 360 

AE AAASAGADGLLVVTPYYS 

AAGCCGAGCCAAGAGGGATTGCTGGCGCACTTCGGTGCAATTGCTGCAGCAACAGAGGTT 
361 + + + .-+ + + 420 

KPSQEGLLAHFGAIAAATEV 

CCMTTTGTCTCTATGACATTCCTGGTCGGTCAGGTATTCCAATTGAATCTGATACCATG 
421 + + + + + + 48Q 

PICLYDIPGRSGIPIESDTM 
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AGACGCCTGAGTGAATTACCTAC6ATTTTGGCGGTCAAGGACGCCAAGGGTGACCTCGTT 
481 + + + + + + 540 

RRLSELPTILAVKDAKGDLV 

GCAGCCACGTCATTGATCAAAGAAACGGGACTTGCCTGGTATTCAGGCGATGACCCACTA 
541 + + + + + + 600 

AATS LI KETG LAW Y SGD D PL 

MCCTTGTTTGGCTTGCTTTGGGCGGATCAGGTTTCATTTCCGTAATTGGACATGCAGCC 
601 + +---- -+ + +'.........+ 660 

NLVWLALGGSGFIS V IGHAA 

CCCACAGCATTACGTGAGTTGTACACAAGCTTCGAGGAAGGCGACCTCGTCCGTGGGCGG ', 



661 + +.........+.... + ...+ --+ 720 

PTALRELYTSFEEG DLV RAR 

GAAATCAACGCCAAACTATCACCGCTGGTAGCTGCCCAAGGTCGCTTGGGTGGAGTCAGC 
721 + + + + + + 780 

E I NAKL SP LVAAQGR LGG V S 

TTGGCAAAAGCTGCTctGCGTCTGCAGGGCATCAACGTAGGAGATCCTCGACTTCCAATT 
781 + +--- + + + + 840 

LAKAALRLQGI NVGDPRLPI 

ATGGCTCCAAATGAGCAGGAACTTGAGGCTCTCCGAGAAGACATGAAAAAAGCTGGAGTT 
841 + + + + + + 900 

MAPNEQELEALREDMKKAGV 

CTATAA 
901 906 

L * - 
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Nucleotide sequence of dapB (SEQ ID NO: 7) 

1 ATGGGAATCA AGGTTGGCGT TCTCGGAGCC AAAGGCCGTG TTGGTCAAAC 

51 TATTGTGGCA GCAGTCAATG AGTCCGACGA TCTGGAGCTT GTTGCAGAGA 

101 TCGGCGTCGA CGATGATTTG AGCCTTCTGG TAGACAACGG CGCTGAAGTT 

151 GTCGTTGACT TCACCACTCC TAACGCTGTG ATGGGCAACC TGGAGTTCTG 

201 CATCAACAAC GGCATTTCtG CGGTTGTTGG AACCACGGGC TTCGATaATG 

251 CTCGTTTGGA GCAGGTTCGC GcCTGGCTTG AAGGAAAAGA CAATGTCGGT 

301 GTTCTGATCG CACCTAACTT TGCTATCTCT GCGGTGTTGA CCATGGTCTT 

351 TTGCAAGCAG GCTGCCCGCT TCTTCGAATC AGCTGAAGTT ATTGAGCTGC 

401 ACCACCCCAA CAAGCTGGAT GCACCTTCAG GCACCGCGAT CCACACTGCT 

451 CAGGGCATTG CTGCGGCACG CAAAGAAGCA GGCATGGACG CACAGCCAGA 

501 TGCGACCGAG CAGGCACTTG AGGGTTCCCG TGGCGCAAGC GTAGATGGAA 

551 TCCCaGTTCA cGCAGTCCGC ATGTCCGGCA TGGTTGCTCA CGAGCAAGTT 

601 ATCTTTGGCA CCCAGGGTCA GACCTTGACC ATCAAGCAGG ACTCCTATGA 

651 TCGCAACTCA TTTGCACCAG GTGTCTTGGT GGGTGTGCGC AACATTGCAC 

701 AGCACCCAGG CCTAGTCGTA GGACTTGAGC ATTACCTAGG CCTGTAA 
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Amino acid sequence of dapB (SEQ ID NO: 8) 

ATGGGAATCAAGGTTGGCGTTCTCGGAGCCAAAGGCCGTGTTGGTCAAACTATTGTGGCA 
1 * + + + + + + 60 

MGIKVGVLGAKGRVGQTIVA 

GCAGTCAATGAGTCCGACGATCTGGAGCTTGTTGCAGAGATCGGCGTCGACGATGATTTG 
61 + ..+ + + + + 12Q 

AV NESD DLELVAE IGVDDDL 

AGCCTTCTGGTAGACAACGGCGCTGAAGTTGTCGTTGACTTCACCACTCCTAACGCTGTG 
121 - ------- - + + ---+- + "--+ + 180 

S L L VD N G A E V V V D F TT P N AV 

ATGGGCAACCTGGAGTTCTGCATCAACAACGGCATTTCTGCGGTTGTTGGAACCACGGGC 
181 ---- + +- +---......+ + 240 

M G N L E FC I N N G I SAV VG TTG 

TTCGATaATGCTCGTTTGGAGCAGGTTCGCGcCTGGCTTGAAGGAAAAGACAATGTCGGT 
241 + + + + + + 300 

FDNARLEQVRAWLEG KDNVG 

GTTCTGATCGCACCTMCTTTGCTATCTCTGCGGTGTTGACCATGGTCTTTTCCAAGCAG 
301 + + + + + + 360 

VLIAPNFAISAVLTMVF SKQ 

GCTGCCCGCTTCTTCGAATCAGCTGAAGTTATTGAGCTGCACCACCCCAACAAGCTGGAT 
361 + - + + --+ + + 420 

AA RFFESAEVIELHHPNKLD 
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GCACCTTCAGGCACCGCGATCCACACTGCTCAGGGCATTGCTGCGGCACGCAAAGAAGCA 
421 + + + + + + 48Q 

APSGTAIHTAQGIAAARKEA 

GGCATGGACGCACAGCCAGATGCGACCGAGCAGGCACTTGAGGGTTCCCGTGGCGCAAGC 
481 + --+ ■---+ + + + 54Q 

GMDAQPDATEQALEGSRGAS 

GTAGATGGAATCCCaGTTCAcGCAGTCCGCATGTCCGGCATGGTTGCTCACGAGCAAGTT 
541 + + + ....+ ..+ + 500 

V D G I P V H A V R M S G M V A H E Q V 

ATCTTTGGCACCCAGGGTCAGACCTTGACCATCAAGCAGGACTCCTATGATCGCAACTCA 
601 + +---■ + + + ...+ 660 

I FGTQGQTLTIKQDSY DRNS 

TTTGCACCAGGTGTCTTGGTGGGTGTGCGCAACATTGCACAGCACCCAGGCCTAGTCGTA 
661 --+ + + + + + 720 

FA PGVLVGVRN IAQH.PG LVV 

GGACTTGAGCATTACCTAGGCCTGTAA 
721 + + 747 

GLEHYLGL* 
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Nucleotide sequence of ddh (SEQ ID N0:"9) 



1 ATGCATTTCG GTAAGCTCGA CCAGGACAGT GCCACCACAA TTTTGGAGGA 

51 TTACAAGAAC ATGACCAACA TCCGCGTAGC TATCGTaGGC TACGGAAACC 

101 TGGGACGCAG CGTCGAAAAG CTTATTGCCA AGCAGCCCGA CATGGACCTT 

151 GTAGGAATCT TCTCGCGCCG GGCCACCCTC GACACAAAGA CGCCAGTCTT 

201 TGATGTCGCC GACGTGGACA AGCACGCGGA CGACGTGGAC GTGCTGTTCC 

251 TGTGCATGGG CTCCGCCACC GACATCCCTG AGCAGGCACC AAAGTTCGCG 

301 CAGTTCGCCT GCACCGTAGA CACCTACGAC AACCACCGCG ACATCCCACG 
351* CCACCGCCAG GTCATGAACG AAGCCGCCAC CGCAGCCGGC AACGTTGCAC 

401 TGGTCTCTAC CGGCTGGGAT CCAGGAATGT TCTCCATCAA CCGCGTCTAC 

451 GCAGCGGCAG TCTTAGCCGA GCACCAGCAG CACACCTTCT GGGGCCCAGG 

501 TTTGTCACAG GGCCACTCCG ATGCTTTGCG ACGCATCCCT GGCGTTCAAA 

551 AGGCcGTCCA GTACACCCTC CCATCCGAAG AaGCCCTGGA AAAGGCCCGC 

601 CGTGGCGAAG CCGGCGACCT cACCGGAAAG CAAACCCACA AGCGCCAATG 

651 CTTCGTGGTT GCCGACGCGG CCGAcCACGA GCGCATCGAA AACGACATCC 

701 GCACCATGCC TGATTACTTC GTTGGCTACG AAGTCGAAGT CAACTTCATC 

751 GACGAAGCAA CGTTgGACgC CGAGCACACC GGCATGCCAC ACGGcGGaCA 

801 CGTGATcACC ACCGGCGACA CCGGTGGCTT CAACCACACC GTGGAATACA 

851 TCCTgAAGCT GGACCGAAAC CCAGATTTCA CCGCTTCtTC ACAGATCGCT 

901 TTCGGcCGCG CAGCTCACCG CATGAAGCAG CAGGGCCAAA GCGGtGCTTT 

951 CACCGTCCTC GAAGTTGCTC CATACtTGCT CTCCCCgGAG AACTTGGAtG 

1001 ATCTGATCGC ACGCGACGTC TAA 
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Amino acid sequence of ddh (SEQ ID NO: 10) 

ATGCATTTCGGTMGCTCGACCAG6ACAGTGCCACCACMTTTTGGAGGATTACAAGAAC 
1 +-- + + + + + 60 

MHFGKLDQDSATTI LEDYKN 

ATGACCAACATCCGCGTAGCTATCGTaGGCTACGGAAACCTGGGACGCAGCGTCGAAAAG 
61 + + + + + + 120 

MTNIRVAIVG Y G N L G R S V E K 

CTTATTGCCAAGCAGCCCGACATGGACCTTGTAGGAATCTTCTCGCGCCGGGCCACCCTC 
121 + + + + + + 180 

L IAKQPDMD LVGI FSR RAT L 

GACACAAAGACGCCAGTCTTTGATGTCGCCGACGTGGACAAGCACGCCGACGACGTGGAC 
181 + + + + + + 240 

DTKTPVFDVADVDKHADDVD 

GTGCTGTTCCTGTGCATGGGCTCCGCCACCGACATCCCTGAGCAGGCACCAAAGTTCGCG 
241 + + + + -+ + 300 

VLF LCMGSATD I PEQA PK F A 

CAGTTCGCCTGCACCGTAGACACCTACGACAACCACCGCGACATCCCACGCCACCGCCAG 
301 + + + +---. + + 360 

QFACTVDTYDNHRD I P R H R Q 

GTCATGAACGAAGCCGCCACCGCAGCCGGCAACGTTGCACTGGTCTCTACCGGCTGGGAT 
361 + + + + + + 420 

V MNEAATAAGNVALVSTGWD 

CCAGGAATGTTCTCCATCAACCGCGTCTACGCAGCGGCAGTCTTAGCCGAGCACCAGCAG 
421 + + + ---+-• + + 480 

PGMFSINRVYAAAVLAEHQQ 

CACACCTTCTGGGGCCCAGGTTTGTCACAGGGCCACTCCGATGCTTTGCGACGCATCCCT 
481 + + + + + + 540 

HTFWGPGLSQGHSDALRRIP 
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GGCGTTCAAAAGGCcGTCCAGTACACCCTCCCATCCGAAGAaGCCCTGGAAAAGGCCCGC 
541 + + + + + + 600 

GVQKAVQYTLPSEEALEKAR 

CGTGGCGAAGCCGGCGACCTcACCGGAAAGCAAACCCACAAGCGCCAATGCTTCGTGGTT 
601 + + + + + + 660 

RGEAGDLTGKQTHKRQCFVV 

GCCGACGCGGCCGAcCACGAGCGCATCGAAAACGACATCCGCACCATGCCTGATTACTTC 
661 + + + ---+ + + 720 

AD.AADH ERI END I RTM PDY F 

GTTGGCTACGAAGTCGAAGTCAACTTCATCGACGAAGCAACCTTgGACgCCGAGCACACC 
721 + + + + + + 780 

VGYEVEVNFIDEATLD. AEHT 

GGCATGCCACACGGcGGaCACGTGATcACCACCGGCGACACCGGTGGCTTCAACCACACC 
781 + + + + + + 840 

GMPHGGHVITTGDTGGFNHT 

GTGGAATACATCCTgAAGCTGGACCGAAACCCAGATTTCACCGCTTCtTCACAGATCGCT 
841 - + + + + + + 9Q0 

VEYI LKLDRNPDFTASSQIA 

TTCGGcCGCGCAGCTCACCGCATGAAGCAGCAGGGCCAAAGCGGtGCTTTCACCGTCCTC 
901 + + + + + + 9 6 o 

FGRAAHRMKQQGQSGAFTVL 

GAAGTTGCTCCATACtTGCTCTCCCCgGAGAACTTGGAtGATCTGATCGCACGCGACGTC 
961 + + + + + + 1020 

EVAPYLLSPENLDDLIARDV 
TAA 

1021 --- 1023 

FIG.1 1B 



Sequence of full length LysA from NRRL B-11474 (SEQ ID NO: 11): 
Underlined region: the priming site for lysA primer 

ATGGCTACAGTTGAAAATTTCAAT6AACTTCCCGCACACGTATGGCCACGCAATGCAGTG 

CGCCAAGAAGACGGCGTTGTCACCGTCGCTGGTGTGCCTCTGCCTGACCTCGCTGAAGAA 

TACGGAACCCCACTGTTCGTAGTCGACGAGGACGATTTCCGTTCCCGCTGTCGCGACATG 

GCTACCGCATTCGGTGGACCAGGCAATGTGCACTACGCATCCAAAGCGTTCCTGACCAAG 

ACCATTGCACGTTGGGTTGATGMGAGGGGCTGGCACTGGACATTGCGTCCATCAATGAA 

CTGGGCATTGCCCTGGCCGCTGGTTTCCCGGCCAGCCGTATCACCGCGCACGGCAACAAC 

AAAGGCGTAGAGTTCCTGCGCGCGTTGGTTCAAAACGGTGTCGGGCATGTGGTGCTGGAC 

TCCGCGCAGGAATTGGAACTGCTGGATTACGTTGCCGCTGGTGAAGGCAAGATCCAGGAC 

GTGTTGATCCGCGTGAAGCCAGGTATCGAAGCCCACACCCACGAGTTCATCGCCACTAGC 

CACGMGACCAGMG1TCGGATTCTCCCTGGCATCCGGTTCCGCATTCGAAGCAGCGAAA 

GCAGCCMCMTGCAGAGMCTTGMCCTGGTTGGTCTGCACTGCCATGTTGGTTCCCAG 

GTGTTCGACGCCGAAGGCTTCAAGCTGGCAGCAGAGCGCGTGTTGGGCCTGTACTCACAG 

ATCCACAGCGMCTAGGTGTCGCCCTTCCTGAGCTGGACCTCGGTGGCGGATACGGCATC 

GCCTACACTGCAGATGAGGAACCACTCA ACGTCGCAGAAGTCGCCTCCGACCTA CTCACC 

GCAGTCGGAAAAATGGCAGCGGAACTAGGCATCGACGCACCAACCGTGCTTGTTGAGCCC s 

GGCCGCGCTATCGCAGGCCCCTCCACCGTGACCATCTACGAAGTCGGCACCACCAAAAAC 

GTCCACGTAGACGACGACAAAACCCGCCGCTACGTAGCCGTCGACGGAGGCATGTCCGAC 

AACATCCGCCCAGCACTCTACGGCTCCGAATACGACGCCCGCGTAGTATCCCGCTTCGCC 

GAAGGAGACCCAGTAAGCACCCGCATCGTGGGCTCCCACTGCGAATCCGGCGATATCCTG 

ATCAACGATGAAATCTACCCATCTGAGATCACCAGCGGCGACTTCCTCGCACTCGCAGCC 

ACCGGCGCATACTGCTACGCCATGAGCTCCCGCTACAACGCCTTCACACGGCCCGCCGTC 

GTGTCCGTCCGCGCTGGCAGCTCCCGCCTCATGCTGCGCCGCGAAACCCTCGACGACATC 

CTCTCACTAGAGGCATAA 
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Full length sequence of LysA (NRRL-B11474) 
DIAMINOPIMELATE DECARBOXYLASE (Lys A) (SEQ ID NO: 12) 

MATVENFNELPAHVWPRNAVRQEDGWTVAGVPLPDLAEEYGTPLFVVDEDDFRSRCRDM 
ATAFGGPGNVHYASKAFLTKTIARWVDEEGLALDIASINELGIALAAGFPASRITAHGNN 
KGVEFLRALVQNGVGHVVLDSAQELELLDYVAAGEGKIQDVLIRVKPGIEAHTHEFIATS 
HEDQKFGFSUSGSAFEAAKAANI^ENLNLVGLHCHVGSQVFDAEGFKLAAERVLGLYSQ 
IHSELGVALPELDLGGGYGIAYTADEEPLNVAEVASDLLTAVGKMAAELGIDAPTVLVEP 
GRAIAGPSTAn"IYEVGTTXNVHVDDDKJRRYVAVDGGMSDNIRPALYGSEYDARVVSRFA f 
EGDPVSTRIVGSHCESGDILINDEIYPSDITSGDFLALAATGAYCYAMSSRYNAFTRPAV ; 
VSVRAGSSRLMLRRETLDD I LSLEA 
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Nucleotide sequence of AS019 lysA (SEQ ID NO: 13) (pRS6) 



1 ATGGCTACAG TTGAAAATTT CAATGAACTT CCCGCACACG TATGGCCACG 

51 CAATGCCGTG CGCCAAGAAG ACGGCGTTGT CACCGTCGCT GGTGTGCCTC 

101 TGCCTGACCT CGCTGAAGAA TACGGAACCC CACTGTTCGT AGTCGACGAG 

151 GACGATTTCC GTTCCCGCTG TCGCGACATG GCTACCGCAT TCGGTGGACC 

201 AGGCAATGTG CACTACGCAT CTAAAGCGTT CCTGACCAAG ACCATTGCAC 

251 GTTGGGTTGA TGAAGAGGGG CTGGCACTGG ACATTGCATC CATCAACGAA 

301 CTGGGCATTG CCCTGGCCGC TGGTTTCCCC GCCAGCCGTA TCACCGCGCA 

351 CGGCAACAAC AAAGGCGTAG AGTTCCTGCG CGCGTTGGTT CAAAACGGTG 

401 TGGGACACGT GGTGCTGGAC TCCGCACAGG AACTAGAACT GTTGGATTAC 

451 GTTGCCGCTG GTGAAGGCAA GATTCAGGAC GTGTTGATCC GCGTAAAGCC 

501 AGGCATCGM GCACACACCC ACGAGTTCAT CGCCACTAGC CACGAAGACC 

551 AGAAGTTCGG ATTCTCCCTG GCATCCGGTT CCGCATTCGA AGCAGCAAAA 

601 GCCGCCAACA ACGCAGAAAA CCTGAACCTG GTTGGCCTGC ACTGCCACGT 

651 TGGTTCCCAG GTGTTCGACG CCGAAGGCTT CAAGCTGGCA GCAGAACGCG 

701 TGTTGGGCCT GTACTCACAG ATCCACAGCG AACTGGGCGT TGCCCTTCCT 

751 GAACTGGATC TCGGTGGCGG ATACGGCATT GCCTATACCG CAGCTGAAGA 

801 ACCACTCAAC GTCGCAGAAG TTGCCTCCGA CCTGCTCACC GCAGTCGGAA 

851 AAATGGCAGC GGAACTAGGC ATCGACGCAC CAACCGTGCT TGTTGAGCCC 

901 GGCCGCGCTA TCGCAGGCCC CTCCACCGTG ACCATCTACG AAGTCGGCAC 

951 CACCAAAGAC GTCCACGTAG ACGACGACAA AACCCGCCGT TACATCGCCG 

1001 TGGACGGAGG CATGTCCGAC AACATCCGCC CAGCACTCTA CGGCTCCGAA 

1051 TACGACGCCC GCGTAGTATC CCGCTTCGCC GAAGGAGACC CAGTAAGCAC 

1101 CCGCATCGTG GGCTCCCACT GCGAATCCGG CGATATCCTG ATCAACGATG 

1151 AAATCTACCC ATCTGACATC ACCAGCGGCG ACTTCCTTGC ACTCGCAGCC 

1201 ACCGGCGCAT ACTGCTACGC CATGAGCTCC CGCTACAACG CCTTCACACG 

1251 GCCCGCCGTC GTGTCCGTCC GCGCTGGCAG CTCCCGCCTC ATGCTGCGCC 

1301 GCGAAACGCT CGACGACATC CTCTCACTAG AGGCATAA 
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Full length amino acid sequence of lysA (pRS6)(SEQ ID N0:14) 

ATGGCTACAGTTGAAAATTTCAATGAACTTCCCGCACACGTATGGCCACGCAATGCCGTG 
1 + + + + + + 60 

MATVENFNELPAHVWPRNAV 

CGCCAAGAAGACGGCGTTGTCACCGTCGCTGGTGTGCCTCTGCCTGACCTCGCTGAAGAA 
61 + + + + + + 120 

R Q E D G V V T V A G V P L P D L A E E 

TACGGMCCCCACTGTTCGTAGTCGACGAGGACGATTTCCGTTCCCGCTGTCGCGACATG 
121 + + ..+... + + + 180 

YGTPL FVVDEDDFRSRCRDM 

GCTACCGCATTCGGTGGACCAGGCAATGTGCACTACGCATCTAAAGCGTTCCTGACCAAG 
181 + + ---+ + + + 240 

AT A F GGPGNVHYASKAFLTK 

ACCATTGCACGTTGGGTTGATGAAGAGGGGCTGGCACTGGACATTGCATCCATCAACGAA 
241 + + + + + + 300 

TIA RWV DEEGLALDIASINE 

CTGGGCATTGCCCTGGCCGCTGGTTTCCCCGCCAGCCGTATCACCGCGCACGGCAAGAAC 
301 + + ---+- + -.---+: + 360 

LG IAL AAG FPASRI TAHGNN 

AAAGGCGTAGAGTTCCTGCGCGCGTTGGTTCAAAACGGTGTGGGACACGTGGTGCTGGAC 
361 + + + + +-- + 420 

KGVEFLRALVQNGVGHVVLD 

TCCGCACAGGAACTAGAACTGTTGGATTACGTTGCCGCTGGTGAAGGCAAGATTCAGGAC 
421 + + + + + + 480 

SAQE LE LLDYVAAG EG K I QD 

FIG.15A 



GTGTTGATCCGCGTAAAGCCAGGCATCGAAGCACACACCCACGAGTTCATCGCCACTAGC 
481 + + + + + + 540 

V L I RVKPG I EA HTHE F IATS 

CACGMGACCAGMGTTCGGATTCTCCCTGGCATCCGGTTCCGCATTCGAAGCAGCAAAA 
541 + -+ + + + + 600 

HEDQKFGFSLASGSAFEAAK 

GCCGCCAACAACGCAGAAAACCTGAACCTGGTTGGCCTGCACTGCCACGTTGGTTCCCAG 
601 + + + + + + 660 

AANNAENLNLVGLHCHVGSQ 

GTGTTCGACGCCGMGGCTTCMGCTGGCAGCAGAACGCGfGTTGGGCCfGTACTCACAG 
661 + + +•: - - -+- - : +- -•- - + .720 

VFDAEGFKLAAERV L GLYSQ 

ATCCACAGCGMCTGGGCGTTGCCCTTCCTGMCTGGATCTCGGTGGCGGATACGGCATT ^ 
721 + + + +-' + + 780 

I H SELGVALPELDLGGGYGI 

GCCTATACCGCAGCTGAAGAACCACTCAACGTCGCAGAAGTTGCCTCCGACCTGCTCACC 
781 + + + + +--- + 840 

AYTAAEEP LNVAEVAS DLLT 

GCAGTCGGAAAAATGGCAGCGGAACTAGGCATCGACGCACCAACCGTGCTTGTTGAGCCC 
841 '-+ + + + + + 900 

AVGKM AAELGI DAPTVLVEP 

GGCCGCGCTATCGCAGGCCCCTCCACCGTGACCATCTACGAAGTCGGCACCACCAAAGAC 
901 + + + + + + 960 

GRAIAGPSTVTIYEVGTTKD 
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GTCCACGTAGACGACGACAAAACCCGCCGTTACATCGCCGTGGACGGAGGCATGTCCGAC 
961 + + + + + + 1020 

VHVDDDKTRRYIAVDGGMSD 

AACATCCGCCCAGCACTCTACGGCTCCGAATACGACGCCCGCGTAGTATCCCGCTTCGCC 
1021 + + + + + + 1080 

NIRPALYGSEYDARVVSRFA 

GMG^GACCCAGTAAGCACGCGCATCGTGGGCTCCCACTGCGAATCCGGCGATATCCTG 
1081 ---,-----+ + +... ,..+...—.....+..- + H40 

E G D P V S T R I V G S H C E S 'G D I L 

ATCAACGATGAAATCTACCCATCTGACATCACCAGCGGCGACTTCCTTGCACTCGCAGCC 
1141 --- + + + + + ----+ 1200 

INDEI YPSDITSGDFLALAA 

ACCGGCGCATACTGCTACGCCATGAGCTCCCGCTACAACGCCTTCACACGGCCCGCCGTC 
1201 + + + + + + 1260 

TGAYC YAM S SR Y NAFTR PAV 

GTGTCCGTCCGCGCTGGCAGCTCCCGCCTCATGCTGCGCCGCGAAACGCTCGACGACATC 
1261 + + + + + + 1320 

VSVRAGSSRLML RRETLDDI 

CTCTCACTAGAGGCATAA 
1321 + 1338 

L S L E A * 
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Nucleotide sequence of orf2 in dapBA operon (SEQ ID NO: 15) 



1 GTGGCC6AAC AAGTTAAATT GAGCGTGGAG TTGATAGCGT GCAGTTCTTT 

51 TACTCCACCC GCTGATGTTG AGTGGTCAAC TGATGTTGAG GGCGCGGAAG 

101 CACTCGTCGA GTTTGCGGGT CGTGCCTGCT ACGAAACTTT TGATAAGCCG 

151 AACCCTCGAA CTGCTTCCAA TGCTGCGTAT CTGCGCCACA TCATGGAAGT 

201 GGGGCACACT GCTTTGCTTG AGCATGCCAA TGCCACGATG TATATCCGAG 

251 GCATTTCTCG GTCCGCGACC CATGAATTGG TCCGACACCG CCATTTTTCC 

301 TTCTCTCAAC TGTCTCAGCG TTTCGTGCAC AGCGGAGAAT CGGAAGTAGT 

351 GGTGCCCACT CTCATCGATG AAGATCCGCA GTTGCGTGAA CTTTTCATGC 

401 ACGCCATGGA TGAGTCTCGG TTCGCTTTCA ATGAGCTGCT TAATGCGCTG 

451 GAAGAAAAAC TTGGCGATGA ACCGAATGCA CTTTTAAGGA AAAAGCAGGC 

501 TCGTCAAGCA GCTCGCGCTG TGCTGCCCAA CGCTACAGAG TCCAGAATCG 

551 TGGTGTCTGG AAACTTCCGC ACCTGGAGGC ATTTCATTGG CATGCGAGCC 

601 AGTGAACATG CAGACGTCGA AATCCGCGAA -GTAGCGGTAG GATGTTTAAG 

651 AAAGCTGCAG GTAGCAGCGC CAACTGTTTT CGGTGATTTT GAGATTGAAA 

701 CTTTGGCAGA CGGATCGCAA ATGGCAACAA GCCCGTATGT CATGGACTTT 

751 TAA 
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0RF2 amino acid sequence (SEQ ID NO: 16) 

GTGGCCGMCMGTTAAATTGAGCGTGGAGTTGATAGCGTGCAGTTCTTTTACTCCACCC 
1 + + + + ---+ + 60 

MAEQVKLSVELIACSSFTPP 

GCTGATGTTGAGTGGTCMCTGATGTTGAGGGCGCGGAAGCACTCGTCGAGTTTGCGGGT 
61 + + + + + + 120 

A D V E WS TDV EGAEA LV E F A G 

CGTGGCTGCTACGAAACTTTTGATAAGCCGAACCGTCGAACTGCTTCCAATGCTGCGTAT 
121 + + + + + + 180 

RA CYETFDKPNPRTASNAAY 

CTGCGCCACATCATGGAAGTGGGGCACACTGCTTTGC7TGAGCATGCCAATGCCACGATG 
181 -+ ---+ + + + + 240 

LRH I ME VGHTALLEHANATM 

TATATCCGAGGCATTTCTCGGTCCGCGACCCATGMTTGGTCCGACACCGCCATTTTTCC 
241 + + + + + + 300 

Y I RG I SRSATHELVRH RH FS 

TTCTCTCAACTGTCTCAGCGTTTCGTGCACAGCGGAGAATCGGAAGTAGTGGTGCCCACT 
301 + + +- + + + 360 

FSQLSQRFVHSGESEVVVPT 

CTCATCGATGAAGATCCGCAGTTGCGTGAACTTTTCATGCACGCCATGGATGAGTCTCGG 
361 + + + + + + 420 

LIDEDPQLRELFMHAMDESR 
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TTCGCTTTCAATGAGCTGCTTAATGCGCTGGAAGAAAAACTTGGCGATGAACCGAATGCA 
421 + + + + + + 480 

FA. FNELLN ALEEKLGDEPN A 

CTTTTAAGGAAAAAGCAGGCTCGTCAAGCAGCTCGCGCTGTGCTGCCCMCGCTACAGAG 
481 + + + + + + 540 

LLRKKQ. ARQAARAVLPNATE 

TCCAGMTCGTGGTGTCTGGAMCTTCCGCACCTGGAGGCATTTCATTGGCATGCGAGCC 
541 4.v + + •- + + : + + 600 

SRIVVSGNF R T W R H F I G M R A 

AGTGMCATGCAGACGTCGAMTCCGCGMGTAGCGGTAGGATGTTTAAGAAAGCTGCAG ; 
601 + --+ .+..-.. . ..-+-. ...... 1+ .._+ 

SE HADVEIREVAVGCLRKLQ 

GTAGCAGCGCCMCTGTTTTCGGTGATTTTGAGATTGAMCTTTGGCAGACGGATCGCAA 
661 + + + + + + 720 

VAAPTVFGDFEIETLADGSQ 

ATGGCAACAAGCCCGTATGTCATGGACTTTTAA 
721 + + +--- 753 

MATSPYVMDF* 
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1 50 
ATCC 13032 V 
N13 C 
ATCC 21529 C 

Consensus MALVVQKYGG SSLESAERIR NVAERIVATK KAGNDVVVVC SAMGDTTDEL 

51 100 

ATCC 13032 
N13 

ATCC 21529 

Consensus LELAAAVNPV PPAREMDMLL TAGERISNAL VAMAIESLGA EAQSFTGSQA 
101 150 

ATCC 13032 
N13 

ATCC 21529 

Consensus GVLTTERHGN ARIVDVTPGR VREALDEGKI CIVAGFQGVN KETRDVTTLG 
151 . -' 200 

ATCC 13032 
N13 

ATCC 21529 

Consensus RGGSDTTAVA LAAALNADVC EIYSDVDGVY TADPRIVPNA QKLEKLSFEE ; 

201 250 

ATCC 13032 
N13 

ATCC 21529 

Consensus MLELAAVGSK ILVLRSVEYA RAFNVPLRVR SSYSNDPGTL IAGSMEDIPV 
251 300 

ATCC 13032 
N13 

ATCC 21529 

Consensus EEAVLTGVAT DKSEAKVTVL GISDKPGEAA KVFRALADAE INIDMVLQNV 

301 350 

ATCC 13032 S G 

N13 A D 

ATCC 21529 A G 

Consensus SSVEDGTTDI TFTCPRADGR RAMEILKKLQ VQGNWTNVLY DDQVGKVSLV 

351 400 
ATCC 13032 T 
N13 T 
ATCC 21529 I 

Consensus GAGMKSHPGV TAEFMEALRD VNVNIELIST SEIRISVLIR EDDLDAAARA 

401 421 

ATCC 13032 
N13 

ATCC 21529 

Consensus LHEQFQLGGE DEAVVYAGTG R 
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Hpal-PvuII fragment comprising the PI promoter (SEQ ID NO: 17) 



MCCGGTGTG6AGCCGACCATTCCGC6AGGCTGCACTGCMCGAGGTCGTAGTTTTGGTACATGGCTTCTG 
GCCAGTTCATGGATTGGCTGCCGAAGAAGCTATAGGCATCGCCACCAGGGCCACCGGAGTTACCGAAGATG 
GTGCCGTGCTTTTCGCCTTGGGCAGGGACCTTGACAAAGCCCACGCTGATATCGCCAAGTGAGGGATCAGA 
ATAGTGCAtGGGCACGTCGATGCTGCCACATTGAGCGGAGGCAATATCTACCTGAGGTGGGCATTCTTCCC 
AGCGGATGTTTTCTTGCGCTGCTGCAGTGGGCATTGATACCAAAAAGGGGCTAAGCGCAGTCGAGGCGGCA 
AGAACTGCTACTACC I I 1 1 1 lATTGTCGMCGGGGCATOCGGCtCCAAGGACGtTTGTTTTCTGGGTCAG 
TTACCCCAAAMGCATATACAGAGACCMTGATTTTTCATTAAAMGGCAGGGATTTGTTATAAGT^ 
TCGTATTCTGTGCGACGGGTGTACCTCGGCTAGAATTTCTCCCCATGACACCAG jr . 
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ORIENTATION: 



Nucleotide sequence of truncated 0RF2 (SEQ ID NO: 18) 



1 GTGGCCGAAC AAGTTAAATT GAGCGTGGAG TTGATAGCGT GCAGTTCTTT 

51 TACTCCACCC GCTGATGTTG AGTGGTCAAC TGATGTTGAG GGCGCGGAAG 

101 CACTCGTCGA GTTTGCGGGT CGTGCCTGCT ACGAAACTTT TGATAAGCCG 

151 MACCCTCGAA CTGCTTCCAA TGCTGCGTAT CTGCGCCACA TCATGGAAGT 

201 - GGGGCACACT GCTTTGCTTG AGCATGCCAA TGCCACGATG TATATCCGAG 

251 GCATTTCTCG GTCCGCGACC CATGAATTGG TCCGACACCG CCATTTTTCC 

301 TTCTCTCAAC TGTCTCAGCG TTTCGTGCAC AGCGGAGAAT CGGAAGTAGT 

351 GGTGCCCACT CTCAT 
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Truncated 0RF2 amino acid sequence (SEQ ID NO: 19) 

GTGGCCGMCMGTTAMTTGAGCGTGGAGTTGATAGCGTGCAGTTCTTTTACTCCACCC 
1 + + + + + + 60 

MAEQVKLSVELIACSSFTPP 

GCTGATGTTGAGTGGTCAACTGATGTTGAGGGCGCGGAAGCACTCGTCGAGTTTGCGGGT 
61 + " + + + + + 120 

ADV EWSTDVEGAEALV EFAG 

CGTGCCTGCTACGAMCTTTTGATAAGCCGAACCCTCGAACTGCTTCCAATGCTGCGTAT 
121 + + +--- -+-- --+ + 180 

RACYETF DKPNP RTASNAA Y ; 

CTGCGCCACATCATGGMGTGGGGCACACTGCTTTGCTTGAGCATGCCAATGCCACGATG 
181 + + + + + + 240 

LRHIMEVGHTALLEHANATM 

TATATCCGAGGCATTTCTCGGTCCGCGACCCATGAATTGGTCCGACACCGCCATTTTTCC 
24i + + "- + +-- + + 300 

YIRGISRSATHELVRH RHFS 

TTCTCTCMCTGTCTCAGCGTTTCGTGCACAGCGGAGAATCGGAAGTAGTGGTGCCCACT 
3 °1 + +•-- + + + + 360 

FSQLSQRFVHSGESEVVVPT 

CTCAT ... 
361 

L (I) 
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Sequence of truncated LysA ( ' LysA) (NRRL-B11474) (SEQ ID NO:20) 

ATGGCTACAGTTGAAAATTTCAATGAACTTCCCGCACACGTATGGCCACGCAATGCAGTG 
CGCCAAGAAGACGGCGTTGTCACCGTCGCTGGTGTGCCTCTGCCTGACCTCGCTGAAGAA 
TACGGAACCCCACTGTTCGTAGTCGACGAGGACGATTTCCGTTCCCGCTGTCGCGACATG 
GCTACCGCATTCGGTGGACCAGGCAATGTGCACTACGCATCCAAAGCGTTCCTGACCAAG 
ACCATTGCACGTTG6GTTGATGAAGAGGGGCTGGCACTGGACATTGCGTCCATCAATGAA 
CTGGGCATTGCCCTGGCCGCTGGTTTCCCGGCCAGCCGTATCACCGCGCACGGCAACAAC 
AAAGGCGTAGAGTTCCTGCGCGCGTTGGTTCAAAACGGTGTCGGGCATGTGGTGCTGGAC 
TCCGCGCAGGAATTGGAACTGCTGGATTACGTTGCCGGTGGTGAAGGCAAGATCCAGGAC 
GTGTTGATCCGCGTGAAGCCAGGTATCGAAGCCCACACCCACGAGTTCATCGCCACTAGC 
CACGMGACCAGMGTTCGGAtTCTCCCTGGCATCCGGTTCCGCATTCGAAGCAGCGAAA 
GCAGCCMCMTGCAGAGMCTTGMCCTGGTTGGTCTGCACTGCCATGTTGGTTCCCAG 
GTGTTCGACGCCGMGGCTTCMGCTGGCAGCAGAGCGCGTGTTGGGCCTGTACTCACAG 
ATCCACAGCGAACTAGGTGTCGCCCTTCCTGAGCTGGACCTCGGTGGCGGATACGGCATC 
Rnr.TAr.Ar.TRCARATGAGGAACCACTCA ACGTCGCAGAAGTCGCCTCCGACCT 
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Truncated sequence of LysA (NRRL-B11474) 

DIAMINOPIMELATE DECARBOXYLASE (LysA) (SEQ ID NO: 21) 

MATVENFNELPAHVWPRNAVRQEDGVVTVAGVPLPDLAEEYGTPLFVVDEDDFRSRCRDM 
ATAFGGPGNVHYASKAFLTKTIARWVDEEGLALDIASINELGIALAAGFPASRITAHGNN 
KGVEFLRALVQNGVGHVVLDSAQELELLDYVAAGEGKIQDVLIRVKPGIEAHTHEFIATS 
HEDQKFGFSLASGSAFEAAKAANNAENLNLVGLHCHVGSQVFDAEGFKLAAERVLGLYSQ 
IHSELGVALPELDLGGGYGIAYTADEEPLNVAEVASDL 
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